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A COEFFICIENT OF LINEAR CORRELATION 
BASED ON THE METHOD OF LEAST SQUARES 
AND THE LINE OF BEST FIT. 


By J. B. CoLEMAN 


Given N points in a plane, corresponding to NV pairs of values 
for two variables, X and Y , we find the line of best fit and 
the line of worst fit, by the method of least squares*. Then we 
derive a cofficient of correlation based on the sum of the squares 
of the distances of the points from these two lines. 

The line of best fit is in the line such that the sum of the 
squares of the distances of the points from it is a minimum. The 
line of worst fit is the one from which the sum of the squares of 
the distances of the points is a maximum. We shall refer to them, 
respectively, as the minimum and maximum lines. 

For convenience we take the origin at the centre of gravity 
of the points, letting x and y _ denote deviations of X and 
Y , respectively, from their arithmetic means. 


1. The two lines pass thru the arithmetic means of the X's , 


and of the Y’s. 


y=rmx+b may represent any line of the plane. The dis- 


tance, @; , of a point, ( Xi, Y; ) , from the line is 


y;-77x,- 0 


[ft me . The sum of the squares of the distances of 






the /V points from the line will be 


*For a general discussion of this method of fitting when varia- 
bles are involved, see, Pearson, Karl, “On Lines and Planes of Closest Fit 
to Systems of Points in Space”, Phil. Mag., 6th series, vol. ii, 1901, P. 559. 
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2 Ly*+ m2 x24 Nb*-2mzxy-LbLy+Zmb=x 
QQ) fg? ——— 
1L+m* 


Using Zx=2y-=O, in the condition for maximum or mini- 


mum values in (1), the condition reduces to B=O, and the 


theorem follows. 


2. To find the slopes of the two lines. 
Kquation (1) now becomes 


(2) id*. Zy*-Lmznry + wee x* 
1+ 777 


“The condition under which (2) will have a maximum or mini- 
mum values, is that 


m*Lxy+m(Lx*-Ly®)-Lxy=O. 
This condition is satisfied hy two values of 77 , namely; 


Ly? 2 xt a (2 x*Zy*?)* +d(Z xy)* 
mM, =’ 


! eLxy 


Ly? Lx* ° [(Lx*-Ly*)*+ 4(Zxy)? 
23. —<—<—< — — ee q—: 


” LE xy 


It is found by considering the second derivative that (4) is the 
condition under which 2 q@* will have a maximum value, and 
‘3) is the condition for a minimum value. 
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The equation of the minimum line is y=77,% , and that 
of the maximum line is y-=77,x% . The value of 77, and 
™, are those given in (3) and (4). 

3. The maximum and minimum lines are perpendicular to each 
other. 

That 777, = -1/m, is easily shown from (3) and (4). 

Further 77, has the same sign as X xy, and 7» , the 
opposite sign, since their numerators are, respectively, positive and 
negative. | 
4. The minimum line lies between the two lines of regression, of 
coincides with them. 


If Zxy >O, 


m, Z y*®-Lxv*e WL x*-Ly?)*+42[x*Zy? 


kixy 


since 2 x* Dy? 2 (LD xy)* 


Hence 7, $Zy*/L xy = yy » the slope of the line of re- 
gression of x ony. 


Rationalizing the numerator of (3), and noting that 7x*Zy* 2 
(Z xy)* , we obtain 
2Zxy Dry 


7, $a *X em, 
Zu*®-Ly?+ MZ x*-Ly*%)* +4Z0x*Ly? Lx* 


i 


the slope of the line of regression of y on x . 


In the same way it may be shown that if 2 xy < O , then Ty & 


™m,< Myx: 
The condition that 77, be equal to the slope of one line of 
regression is the same that-it be equal to the other, so that the 


minimum line coincides with both lines of regression, or else lies 
between the two. 
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5. To find the sum of the squares of the distances of the points 
from the minimum line; also from the maximum line. 


let 2 be the distance of a point from the line y= 777, x 


Sa? > (a yy. ay om nyt "Tn* ; 


Substituting for 77 from (3) and reducing, we obtain 


(5) 
Ed?|ox%Ly?- WZ x*-Ly2)2 + MExy)* |/2. 


Similarly, if J) represents the distance of a point from the 
line y= 77_x , we obtain 


(6) LD*-|Zx2.Ly*+ fEx* Ey? AZ xy) \/* : | 
6. To find a coefficient of linear correlation. 
Let g-\Xae*/ZP* . Substituting from (5) and (6), 


and reducing, we obtain 


ZxLy*® 2x 2 Zy2}*+ A(Z xy)* 
£LME x*# Dy? -(Lxy)* ian 


(7b) ¢: LIZ x*Zy*?-(Z xy)? 
Lx*+Zy*+ (Zx*-Ly*)* + 4(Z xy)* 


(7a) G= 


q represents the ratio of the root-mean-squares of the distances 


of the poirt from the minimum and maximum lines. This ratio 
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is a measure of the closeness of fit of the points to a line, and 
should furnish a measure of linear correlation. The value of 7 
may vary from 0 to 1. gq = O indicates that the points are 
all on a straight line, hence that the correlation is perfect. It is 
of interest to note that when g = QO , (7b) gives Lxy/Nag, 
[=7]=+4 1 . When g is 1 the mean squares of the distances 
of the points from two lines at right angles is the same, and linear 
correlation is lacking. Hence £-g would furnish a coeffi- 
cient conforming to the customs that it have the value 1 for 
perfect correlation, and O for lack of correlation. 

Values of g found from (7a) or (7b) would involve the 
units in which X and Y are given. Hence these forms would 
be objectionable, in that g could be made to assume different 
values for the same data, by changing the units in which x and 
y are expressed. However, this objection may be removed by 
taking Oy and Gy as units in which to express X andy . 
(7b) then reduces to 


IN? -(Zixy)*/o,207 


q= ““Wiateat 
N+iZxy oF, | 


The coefficient /-g may now be expressed as 


- N- |Zxy/o, oy | 
(8) ore -_ NelZxy/oy Jl 


The sign of the coefficient should agree with that of the slope 
of the line to which the points are fitted. Hence, when the value 
for /-g has been found it should be given the same sign as 
the slope of the minimum line. But the slope of the minimum 
line is determined by that of Z xy , so that the sign given to 
1-g should be that of Z' xy. 
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The coefficient /-g may be expressed immediately in terms 
of the Pearson coefficient, r , which is equal to 7.xy/No, Dy. 
Making this substitution in (8) we have 


£-|r 
os salt at 
In the table are shown values of 1-g corresponding to some 
given values of 7 . The values for /-@* have also been listed 
corresponding to the same set of values for >. The maximum 
difference occurs between £-g and 7 when r=,839 and /-g= 
704, a difference of .135 by which L-g is smaller. 


8 
J 
6 
s 
4 
3 
2 
Jl 
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A STUDY OF THE DISTRIBUTION OF MEANS ES. 

TIMATED FROM SMALL SAMPLES BY THE 
METHOD OF MAXIMUM LIKELIHOOD 

FOR PEARSON’S TYPE II CURVE 


By Jonn L. Cartson 


The object of this paper is to study the distribution of esti- 
mates of the parameter of location for Pearson’s Type II Curve. 
estimated by the method of maximum likelihood from small 
samples. 

R. A. Fisher has assumed,’ and Professor Hotelling has 
proved’ that in large categories of cases the distribution of an 
optimum statistic approaches normality as the sample size in- 
creases. This normality has been assumed to hold for optimum 
statistics in general whether calculated from large samples or 
small ones, and it has also been assumed that optimum statistics 
have minimum variance and always give better fits than do sta- 
tistics calculated by the method of moments. That this is the case 
whenever the sample is large and the distribution of optimum 
statistics normal is made plausible by the reasoning of R. A. 
Fisher.* In case the sample is smal!, however, there may be reason 
to doubt that the normality of distribution of optimum statistics 
holds, and that the other conclusions hold. It is with this phase of 
the subject that we shall be concerned in what follows. 

Before entering into the topic under discussion it will be con- 
venient’ to review some of the more elementary facts regarding the 
curve with which we are to be concerned. We shall take first the 
general equation for the curve in the form, 


. ) P 
(1) y-y,(1- =F 





10On the Mathematical Foundations of Theoretical Statistics, R. A. 
Fisher, Phil. Trans. Series A, Vol. 222, 1922. Pp. 309-368. 
2The Consistency and Ultimate Distribution of Optimum Statistics.. 
Harold Hotelling, Trans. Amer. Math. Soc. Vol. 32, No. 4. Pp, 847-859. 
3Ibid, Pp. 328-368. 
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and determine the effect of variation of the constants. 

When =O the equation reduces to the straight line y=y, 

When p-=+#1 the equation is that of a parabola with 
y=¥Yy, atthe point x =77 andthe x intercepts at the points 
x=7+ @. Itof course meets the x axis at an angle. 

When p= +2 the equation .y=O is of the fourth degree 
in x with double roots at the points x = % @. 

In general when p= 7 the equation y=O is of degree 
Z2nin x and has two sets of 77-fold multiple roots x=+ a. 

Since our curve is to be a probability curve it will of neces- 
sity have unit area, and this fact makes it possible for us to 
evaluate y, in terms of the parameters @ and p. In order to 
lo this we shall perform the integration below. 


x4*a a 2 P 
Area <1 / y ae = by, | I-E5) ole 
Qo 


x-a 


whence 


T 
Zz 
t-Zay,/ cos*P*! 6. a2@e 
o 


but now since 


,. Vr C#) 
[es OdO=-5 7 (2g) 


Q 


. Jw (p+) 
Ll =2Zay,’5 Flo-3) 


'(p+%) 


40° oR lpr) 
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Therefore 


3 si een 
Gy avr [’(prl) 


ax 


'lp+z) : a 


Formula (2) we shall find to be of value a number of times 
and formula (3) is the form in which equation (1) will be used 
throughout the remainder of the paper. 

It will be worth while now to consider the likelihood function 
Z together with its first and second partial derivatives with re- 
spect to 777. (We shall hereafter refer to 77 as the parameter 
of location, @ as the parameter of scaling, and as the para- 
meter of shape.) We are to use 777 to denote the estimate of 777 
obtained by the method of maximum likelihood, in accordance 
with the convention introduced by Fisher,’ and it will be with this 
parameter that we shall concern ourselves in this investigation. 

We have from (3) on the preceding page 


a 
S%0+2) 7 hc 
= énieiatitanay / a am 


and so 


aL ” 2p 4; -777 
om t7]  @*-(x,-m)e 


and 
OL _ a *+ (x, -17)* 
) —* -£P es, [a 2-(x,; -77) 2] 


1Tbid Pp. 309-368. 
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At this point we shall stop to consider the effect of variation 
of the parameters @ and p upon our estimate of mm. Let us 
first consider 9. Since the method of maximum likelihood is 
here mercly the method of the differential calculus it follows from 
a consideration of equation (5) that our estimate of 777 will be 
independent of o . for any particular sample. Such is not the 
case when we consider @ . however, for any change in @ allows 
a change in the variance of 777 for the particular sample. 

We shall find it advantageous to cover as much of the the- 
oretical work as possible before embarking upon our experimental 
check and its great amount of numeral calculations, for it is only 
by means of a check between theory and experiment that we are 
able in the present state of knowledge, to judge of the applicabil- 
ity of the method of maximum likelihood to small samples. Of 
course it will be necessary to consider the distribution of our 
estimates of 77 . in order to make this statistic of practical use, 
and it is desirable to know the theoretical variances of % the 
arithmetic mean, 777, and the experimentally obtained variance of 
the distribution of our estimates of 77. The first of these we can 
obtain from theory, the second from an approximation valid only 
in the limit, and the last by means of calculation based on actual 
sampling. 

We shall be concerned first with the theoretical variance of 
x . It is well known that the variance of % is equal to the 
variance of the distribution divided by ‘77. We must first, there- 
fore, find the variance of the distribution. 

From (3) page 88iit follows that 


2 2fPrz) 
* aff T(p+/) 


2 2.? 
x2(1-23) a 
° 


o*= 


2l(er+F) 2e43 
eo 
Saal Mos” -cos Oj\d 
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remembering now (2) page 87 


2 
2 a 


o"= Fa8 
whence it follows that 


(7) oz a = 7(2pra) 


We shall now calculate the limiting form of the variance of 
777. Fisher has proved? that if the distribution of optimum sta- 
tistics is normal the variance of an optimum statistic is equal to 
the negative reciprocal of the mathematical expectation of the 
second partial derivative of the logarithm of the likelihood with 
respect to the parameter in question. 
We may write, therefore 
fe -4np/l(pr2) ° (a* x*) a ” 
Of alAl(or+t) Ss (a*-x*) [Z- ze | ” 
dials Zz whence 
“ag * af / (2c0s*” 8 - cos*” “ade 


and so again referring to (2) page 87 we have 


ae 2np (p+) 


Tf, a*(p-1) 
hence we have 


2 2a*(p-h) 
(8) Ga = np(Zprl ) 


The efficiency of the mean is then 


_ @-Lepr7) | FF 


(9) Dleprl) of 


The next problem with which we must concern ourselves is 
that of experimental verification of the assumptions under discus- 


1Tbid Pp. 327-328. 
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sion. The problem is briefly that of choosing a number of small 
samples from a population which obeys our law of frequency, 
estimating 777 for these samples, and then calculating the variance 
for the distribution. Also we shall draw an histogram of the dis- 
tribution and observe the general type of the distribution, so near- 
ly as that is possible from our samples. 

The problem of choosing our samples is not the least of our 
difficulties, for we can not take all types of samples. They must 
be of a very special nature: they must be from a population of the 
Type II. In order to accomplish this it will be necessary to have 
a table of areas corresponding to given values of x for the Type 
II Curve. There is no such table available to the knowledge of 
the writer, and it is therefore necessary to construct the table be- 
fore we can procede with the choosing of the samples. After the 
table has been built, we can with the aid of Tippett’s Tables,? 
choose our samples with ease. The manner of choosing is as fol- 
lows. Take the numbers from Tippett’s Tables as areas under 
the Type II Curve and look up in the table of areas the values of 
x corresponding to the smallest area containing the area found 
from the Random Numbers. This will give the value of x to be 
taken. Since we will take four digits let the fifth digit determine 
the sign. If it is odd take the sign - , if it is even take the sign + . 

There are two ways in which a table of this nature can be 
prepared, and the method employed must in any case be de- 
termined by the degree of accuracy attainable and the amount of 
labor involved. One of these methods is that of the calculus of 
finite differences, determining the zero order differences by means 
of algebra, and from these by the process of addition building up 
the table. This method is best used when a dependable listing 
adding machine is at hand. The other method is that of direct 
integration, and it is found that with the aid of two calculating 
machines this is by far the quicker. It was this method that was 
applied in the building of the table on page 92 and 93. 


1Tracts For Computers, No. XV. 








Constants. y, = 


me TT 
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Table of Areas Under Pearson’s Type II Curve, Correct to 


9 Places of Decimals. The Areas are included between ordinates 


* x units from the parameter of location. 





‘Ya 





0.000 000 000 
018 748 750 
037 490 001 
056 212 259 
074 920 038 


093 593 867 
112 230 292 
130 821 880 
149 361 229 
167 840 964 





186 253 750 as 2 


204 592 289 
222 849 331 
241 017 673 
259 090 168 
277 059 727 
294 919 322 
312 661 995 
330 280 859 
347 769 104 


382 326 904 
399 383 261 
416 282 613 
433 018 598 
449 584 961 
465 975 552 
482 184 334 
498 205 389 
514 032 918 





7=0Q pzk, and a-f 








Area 


529 661 250 
545 084 843 
560 298 291 
575 295 077 
590 073 828 
604 625 820 
618 947 482 
633 034 148 
646 881 319 
660 484 657 


673 840 000 
























686 943 358 
699 790 921 
712 379 067 
724 704 358 
736 763 555 
748 553 612 
760 071 688 
773 151 488 
782 281 572 


792 968 750 


803 374 696 
813 497 651 
823 336 081 
832 888 688 
842 154 414 
851 132 442 
860 009 702 
868 223 379 
876 335 911 


884 160 000 
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892 546 290 
898 944 981 
905 907 620 
912 585 318 
919 120 273 
925 092 472 
930 925 941 
936 482 509 
941 764 926 


_ | 946 776 250 


951 519 851 
955 999 411 
960 294 310 
964 182 748 
967 895 508 
971 362 202 
974 588 156 
977 579 039 


985 203 165 
987 317 441 
989 230 274 
990 949 478 
992 483 242 
993 840 132 
995 029 095 
996 059 469 
996 940 979 


998 298 304 
998 795 571 
999 186 888 
999 484 008 
999 699 102 
999 844 762 
999 934 010 
999 980 299 


980 340 865 999 997 519 


The table on the preceding page was built by direct integration 
of (3) page 88, with p=zZ and a=1 These values for the 
parameters were chosen so as to save as much labor in calculation 
as possible, and at the same time maintain the desired shape of the 
curve. There has of course been_no less of generality in setting 
a=1 but we have limited ourselves quite definitely in using the 
value p-2. The accuracy of the table to 9 places of decimals 
has been assured by calculating all values to 13 places and then 
determining the maximum error over the. whole range which was 
625 in the 13 place due to the use of the decimal equivalent of 2/5 
in the third degree term. 

Our table of areas being complete and the problem of sampling 
thus solved, we must next consider the task of estimating ™. 
Since we have already taken @= f and  =2 in building our 
tables we shall continue to use these values throughout the work. 
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The next question to be settled before beginning our work is the 
size of the samples with which we are to deal. The case 7= 1 is 
of course of no interest for the best estimate of a single observa- 
tion is the observation itself. The case 77= 2 is likewise of no 
interest for in this case the arithmetic mean coincides with the solu- 
tion by the method of maximum likelihood. Therefore it is the 
case 77= 3 with which we shall be concerned. Our results are 
not then trivial, and at the same time we have the case which is the 
easiest to deal with, since the number of numerical calculations for 
each sample is reduced to the minimum. 

Before going ahead with any attempt at solving an equation 
suchas (5) page®&%, it is well to have in mind a picture of what we 
are actually trying to accomplish. With such a picture in mind we 
are better able to realize the difficulties of the situation and so are 
better able to cope with them. To this end we have included a 
graph of the problem involved which will make clear at a glance 
just what must be done to find the true value of 777. 

We have drawn, page 96 . from plotted points the curves 
OL 
Om ° 


representing Z , aan and the three terms of Also we 


have drawn in the asymptotes to the curve which are of significance. 


OL 


Tf we are able to find the point at which == =O we have the 


Om 

solution to our problem. This involves solving a fifth degree equa- 
tion. We shall use Newton's method of successive approximation. 
I‘isher states’ that in some cases at least, we may start with an in- 
efficient statistic and by a single approximation obtain an efficient 
one. Whether or not this is the case for small samples we shall 
see when our calculations have been analyzed. 

It will be seen upon examining the graph that as__is allowed 


OL 


varies from -o to +400 , 
om 


to vary each of the therms of 


1Theory of Statistical Estimation, R. A. Fisher, Proc. Cam. Phil. Soc. 
Vol. XXII part 5. Pp. 708-709. 
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and so their sum also varies between these limits. It will be seen 
that the asymptote corresponding to the largest allowable value of 
mcan be found by adding the value of a to the smallest observation. 
and that the asymptote corresponding to the smallest allowable 
value of m may be found by subtracting the value of a from the 
greatest observation. It is thus evident that as dispersion in the 
sample increases the variance of m must surely decrease. That this 
fact is of fundamental importance in choosing our first estimate 
of m will be seen from consideration of the following case, since 
it is well known that in the case of a curve with a real finite pole 
Newton's method may lead us to erroneous results. Let us con- 
sider the sample consisting of the three observations 2x,=- .99, 
%2=# 99, x 5=+.99, for which the arithmetic ‘mean is 
%¥= + 33. If now we take as our first approximation 777, - X 

we will immediately lead ourselves to an erroneous value of 777, , 
our estimate of 7. That this is the case will be easily seen by 
means of the check given on page 94 for using @= 1 we locate 
the asymptotes at x= 7 .01. The true value of 777 located be- 
tween these asymptotes, is not therefore even to be approached 
should we take 777, =X . This is an extreme case and fortunate- 
ly not to be anticipated very often, or the arithmetic mean would 
be deprived of any value whatsoever. We should always be sure 
that the difference between any observation and the value of m 
that we are using is not greater than the value of a when dealing 
with the Type II Curve. This holds no matter what our manner 
of attack may be. 

We shall now be concerned with the calculations for our 100 
samples of 3. .\ll of the data are tabulated in such a way as to be 
self-explanatory, and so we shall not bother to give sample cal- 
culations. ‘The next ten pages cover these calculations. The dis- 
ctission is continued on page 104, 
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+17 + .254667 ~ 1.120600 
1 ~—e -.0700 |- a + .009840 |-1.051500 |- 3.184900 |-.066879 
- 1.012800 
-.0 0; 5 - 1.021059 
2 2 Kr +.0533  Jizoet +,016730 |-1.133055 |- 3.438830 |+.049737 
3 |+.34 + 312676 ~ 1.284716 
1 |+.67 + 254668. - 1.190833 
3/2 |-.11 |+.4300 |- .762281 2.573777 |\— 5.080876 |+.465022 
3 |+.73 + 329670 ~ 1.316266 
-.54 - .291230 - 1.248262 
4/2 B ke -.2700 ; ae +.018882 a — 3.655622 |-.264835 
+ 
-1.7185 
“it +.0067 |+ + 100s “gaat - 4.101439 | +.013936 
+ 347440 - 1.35029 
2 - .408 -1.479856 
6 2 ie +,0967 stone -1.001630 |-3.858903 |+.103011 
+,42 + 361036 -1.377417 
1 |+.54 41.770451 —4,424677 
7 -.50 |-.2167 |- .308021 |+.853498 |-4.277016 |- 9.978709 |-.131268 
3 |-.69 - .609932 - 1.277016 
1 +19 + 815850 -2.218101 
8|2 |-.64 |-.3700 |-.291230 |+.207992 |-1.248262 |- 4.758692 | -.326292 
3 |-.66 - 316628 - 1.292329 
-.12 + 361036 - 1.377433 
912 |-.48 |-.4433 |- .036749 |+.011911 |- 1.004049 | - 3.666198 |- .440051 
ane - .312376 - 1.284716 
1 +49 + 167774 - 1.083693 
10|2 |-.77 |-.6533 |- .118311 |+.002661 /- 1.041802 | - 3.132061 |-.652450 
3 |-.70 - .046802 — 1.006566 
1 |-.07 + 066896 - 1.013446 
1 : — -.1367 Mg 4 -.009546 |-1.098764 | -3.308885 |-.139555 
- 1.196675 
. ‘= 1.049476 
12 “72 ~.5267 - 1.119217 -.528283 
+ + O6cbe7 - 1.013434 
- - 484723 —.387058 -t so ~ 4.343556 |-.352411 
+ 104414 | 
* eo -.010034 “1 179312 -.269775 
+ 107928 
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Sam- 
ple 





eel ee el 


15 = Ea ; + 36948 + .002713 - 3.177717 |- 235846 
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In order to analyze the results of the calculations on the pre- 
ceding ten pages it is necessary that we find the theoretical vari- 
ance for X and 7 from the formulae derived for this purpose 

‘lier in this paper. 
From (7) page 90,we get after setting a@-1, o-2,and 7-3, 
de 
(10) OF “—* 047619 
and from (8) page 90 after similar substitutions 


2 - 
(11) Op = 3, .033333 


ft 
30 

From (9) page 90 we get for the efficiency of the mean when 
(12) p=2 £2.70 


Now by actual calculation from the ungrouped data the mean 
square deviation from zero which we shall designate as 


nz* 
(13) TF = .048611 
and 

ame 
(14) —7" = 047612 


Comparing (10) and (13) it is evident that such a difference 
can be said to be well within the limits of random sampling. The 
difference between (11) and (14) is of such magnitude that we 
can not say that it might be expected in the course of random 
sampling. Now since 7, is our estimate of 777 it is evident that 
either a single approximation by Newton’s method is not adequate 
to give the best results or the approximation, 


2 _a*(p-)) 
co fea) =< np (Zr ? 
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to the variance is not valid in the case of small samples. That the 
latter seems to be the case the writer firmly believes. The reason 
for this belief lies in the fact that in a subsequent case a sample of 
3 was examined by Newton’s method, and starting with 77, 
=%~= -.07 the values 77,=- .066879 and m3 = — 066791 
were obtained. There is not sufficient improvement here to cause 
one to suppose that by taking a third approximation we would 
obtain a variance in keeping with the one derived from theory. 
Also considering the mean square deviation from zero for % and 
m, it seems that the gain in accuracy to be expected from the use 
of the method of maximum likelihood solution instead of the arith- 
metic mean is not sufficiently great, in the case of samples of three, 
to warrant the additional labor involved in calculation. We must 
be sure, however, that in using the arithmetic mean we are using 
an approximation to m which complies with the qualifications given 
on pages 95 and 96. A graph of the distribution as found from 
the calculations is given on pagel06. The histogram represents 
the grouped data while the smoothed curve is a rough approxima- 
tion to the actual form of the distribution. 


Histogram Data 
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The totals have been used in the histogram which has been 
forced to be symmetrical so as to give the effect of a sample of 
twice the size. 

The tabulation of the histogram data draws attention to the 
great excess of samples having negative values for X and 7722. 
This has caused the writer no little concern. In examining the 
signs of the observations we note that there are 183-— and only 
117+ values. Assuming + and - values to be equally likely 
this gives a deviation from 150 of 33 or 3.81 times its standard 
error which is incredible. It seems therefore that Tippett’s num- 
bers are not random in this respect, and that it perhaps would have 
been better to toss a coin to determine the signs. 

As a final check and in an effort to place the type of the dis- 
tribution the value of @2 has been calculated, and found to be 

373.056 
8, was not calculated as the excess of negative signs would lead 
to an erroneous value. There is every reason to believe that 4, 
should be zero. These facts suggest that the curve is very near to 
the normal curve, but perhaps slightly more leptokurtic. But, 
why, if this is the case, there is not better agreement between (11) 
and (14) page 104 the writer is unable at the present to state. 
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A NEW THEORY OF DEPRECIATION OF 
PHYSICAL ASSETS 


By Robert E. Moritz 


1. The various methods employed for the computation of 
the depreciation of a physical asset are as many devices for re- 
covering, by means of a yearly charge to production during the 
life-time of the asset, its reduction in value. The methods differ 
according to the answers given to such questions as the following: 
Should the yearly charge be based on original cost or on replace- 
ment value? Should the yearly charge be uniform over the life- 
time of the asset? If not uniform, should the depreciation charge 
be proportional to the actual reduction in market value, or, in the 
case of new plant, should such charges be minimized or wholely 
deferred during the earlier years when the plant is trying to estab- 
lish itself? Should interest be disregarded, or should the yearly 
charges to production be accumulated with interest? If interest 
is to be considered, should the rate be the effective rate on the 
capital employed in the business, or the commercial rate? All 
these questions and others have received careful consideration.’ 

Quite as important as those already mentioned are two other 
considerations, which have been generally ignored or overlooked. 
There is first the rather obvious fact that depreciation is insepar- 
ably tied up with the question of repairs. Depreciation is greatest 
when the asset is new, when repair charges are negligible, and it 
diminishes as repair costs grow. The problem of depreciation, 
therefore. cannot be adequately treated aside from repairs. Within 
certain limits depreciation, may be compensated by repairs. The 
yearly depreciation charge to production, therefore, should not be 
based on original cost or renewal cost alone, but on original cost 
plus costs of repairs during the life-time of the asset. 

In the second place, the life-time of an asset is not a constant 


1See Saliers, E. A., Depreciation, Principles and Applications, New 
York (1922). 
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as is generally assumed, but is a variable which, like depreciation 
itself, is definitely related to the repair function. Aside from ob- 
solescence, the value of an asset could be kept practically intact 
indefinitely by sufficiently increasing the outlay for repairs. There 
is always a:threshold period of time when it is a question whether 
to scrap or to continue to repair, and frequently this threshold 
extends over a period of vears. In short, the life-time of an asset 
is generally an unknown quantity, the determination of which rce- 
auires the solution of an equation which expresses the condition 
that the annual charge to production, necessary to recover thre: 
original cost of an asset together with all repair costs, shall be a 
minimum.” 

The present paper is an attempt to treat the problem of de-. 
preciation from the point of view here suggested. The problem, 
then, is to determine the life-time of an asset such that the annual 
charge to production, necessary to cover original, cost and all re- 
pair charges, shall be a minimum. 

2. Let us denote the original cost of the asset by C. the. 
cost of repairs during the first x years by R(x), then the total 
outlay to be recovered is C + R(x). Furthermore, let U/(%) 
denote the average yearly charge to production necessary to re- 
cover the total outlay in x% years. Then, disregarding for the 
present all interest considerations, we have 


C+R(x) 


a 


(1) U(x) = 


The notation C/(%) suggests that, in general, the unit charge to. 
production will be a function of x . 

We shall now define the life-time of an asset as that value of 
x which will render the value of (//x) in (1) a minimum. 


2J. S. Taylor, A Statistical Theory of Depreciation, Journal of the 
American Statistical Association, Vol. 18 (1923), p. 1010, is, I believe, the 
first writer who recognized in part the principle here set forth. He calls 
attention to the fact that the useful life of a machine depends both on the 
manner of distributing depreciation charges and on the assumed interest rate.. 
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The analytical conditions that a function of x may have a 
minimum are that the first derivative of the function with respect 
to x be zero and that the second derivative of the function with 
respect to x be positive. If, as is customary, we denote the first 
and second derivatives of U(x) by Uc) and U(x) re- 
spectively, and those of R(x) by R(x) and FP (x) respective- 
ly, we find on differentiating 


xR (%)-C-R(x) 


? 
xe 


U (x) = 


x7 R (x2)- 22xR (1) + 2RC)+2C _R(«)-2U) 


U “(z%) = a ~ 


from which it is evident that the life-time of an asset must satisfy: 
the two conditions 


(2) 2R(x)-C-Rie):0.  R'le)>O 


In short, the life-time of an asset is given by that root of 
the equation 2x P’/z)- Rx): C which will make P'"(z) > O. 

For example, let us suppose that the repair function is given 
by the equation RP) = ax*%+bx+c. Then P(x%)=Zax+h 
RP (<)-2a, and the conditions (2) reduce to 


Azz le, a>a 


The life-time of the asset is therefore equal to /(C+c)/a 
provided the coefficient @ is positive. It is interesting to observe 
that x is independent of the constant b . 

3. In the preceding discussion no allowance was made for 
the salvage value of the asset. Let us denote the scrap-value of 
the asset after x years by S(x), then the average yearly de-. 


ee - 


ee 


a 
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preciation, interest again not considered, is 


C+ R(x)- SCe) 


(3) Ure) = 7 


and the conditions which will make U/()a minimum are 
(4) [2 )-Se)|-|C+h)-Sbe)] =O, Ce) > SC) 


If the scrap-value is a constant, both S(v)and S“(x) 
vanish, and the life-time of the asset is determined by 


(5) xP (x%)-C-R(2)+S(4) =O = P"Ce)>O 


The conditions (2) include (4) if we replace (2) by (x) 
- S(x), that is, if in the outset we diminish the repair function by 
the salvage value at time x ; to include (5) it is sufficient to re- 
place C , the original cost of the asset, by C-5, the difference 
between original cost and scrap value. With these modifications 
we may treat (2) as representing the general case. 

4. To avoid any possible confusion, let us denote by 7/x) 
the total outlay to be recovered by uniform annual charges to 
production during the life-time of the asset. Taking account of 
the residual value S(c) we see that 


(6) T(x)=C+R(%)-S(x), 7 (2) = RC) -S Ce) 
and (3) and (4) take the simpler forme 

(7) Ue) = Tle)fx, x7 (#)-Tle)=O, RC) >Sbe) 
From the Grst and second of the equations (7) ‘allows: 

(8) T (2) = U(x) 


which may be appropriately called the life-equation of an asset 
since its solution yields the life-time of the asset as defined in 1. 
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5. When the repair function and the salvage function are 
known, the real roots of the life-equation may be found either 
hy direct methods or by methods of approximation. However, in 
the great majority of cases which occur in practice the value of 
7 (x) is given only empirically, from the recorded experience re- 
lating to the asset in question, and the data available may not lend 
itself to analytical treatment. In all such cases the life-time of 
the asset may be determined approximately by means of the fol- 
lowing simple graphic method. 


Q Cc 
FIGURE 1. 
Let AB (Fig. 1). represent the graph of the equation 
y = 7 (x) , constructed in Cartesian coordinates. We shall call 
it the total outlay graph, because the ordinate y of any point 
(x, y) on this graph represents the total outlay during the period 
of x years if the asset were scrapped at the end of this period. 
The straight line. OP, joining the origin O to any point P 
on A 8. we shall call the uniform charge to production graph. 


It enables us to determine at sight the aggregate amount that must 
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be charged to production during any given period of time in order 
to recover the total outlay O/ for the time x on the basis of 
uniform distribution over the entire period x. If x is ex- 
pressed in years, the ordinate (/ V , of the point on OP whose 
abscissa is unity, will represent the uniform charge to production 
per year, which is required to recover the totai outlay for ~ 
years. 

Now it is obvious that this unit charge U/V will vary with 
the slope of the line O . It will be least when the slope is least, 
that is to say, when the point P is such that the line Of is 
tangent to the total outlay graph. The abscissa, OC , of the point 
of contact, 7 , is then the life-time of the asset under considera- 
tion. 


To determine the life-time of an asset, interest considerations 
being disregarded, we need therefore only construct the total 
outlay graph 48, then draw the tangent O7 , and finally mea- 


sure the abscissa of the point of contact 7 . 
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FIGURE 2 
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(lig. 2)shows the construction when 7 (cc) has the forms 
10 x*+ 25x + 1000, 
10 x* + 1000, and 
10 x*- 25x + 1000 
respectively. In each case the life-time is found to be 10 years, 
which verifies the theoretical conclusion of 2: that the life-time is 
independent of the coefficient of x . 

We have seen from graphical considerations that the uniform 
charge to production will be a minimum when its graph is tangent 
to the total outlay graph. This condition is precisely the condi- 
tion expressed by equation (8), which asserts that when x is 
the life-time of the asset 7 (x) . the slope of the tangent to the 
total outlay graph, is numerically the same as the yearly charge 
to production. | 

6. We now come to consider the problem of finding the life- 
time of an asset when interest at a specified rate is to be taken into 
account. In this case, the various items that make up the total 
outlay, as well as the component charges to production, must be 
replaced by their present values at some arbitrarily chosen epoch, 
as say, the epoch zero. 

Let us attempt an analytical solution of the problem. LetAt 
represent a small interval of time. The outlay during the interval 
fromt to t+Qt is 7(t+At)-T7(t) If the specified rate 
of interest is, /-, and if we represent the discount factor by 1/(/ + i) 
the conventional symbol v , then the present value of 

T(t+At)-T(t) 
at the epoch O has some value between (77 t+Op- T(t)] yt 


and [7(¢+d4t)- Tt) ot let us say 
[7/t+At)-Tt)\v 244? 


where G has some value between O and 1. The 





total outlay during the time ¢ , evaluated for the epoch O , is 
therefore 


(9) C+2 [Tt At)-Tt)|v 770% 
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‘the sum extending over all the time intervals between O and ¢. 


t+0.At , V t+6_- At 


Now v’>v where 6,,is the greatest 


among all the fractions 0. We have, therefore, 


[Tleedt)-Tee)|v* »[Tle+At)-7re)] v 04? 
(10) 
> [T(t At)-Tit)|v ** rr 


If the intervals ZA7¢ are all equal and their number 7, then 
At =t/7, and as 7 is increased indefinitely 2 + approaches 
O. Then t+@_. At approaches ¢ , and we see from (10) that 
(9) must have the same limit as 


(11) Cod [77t+At)- T(t) |v# 
To determine this limit we write 


Wevted-160)~« TEMA! as 
Ar 
where the first factor on the right represents the difference quotient 
which approaches 7 (t) as a limit as A+ approaches O as a 
limit. With this relation introduced into (11), we obtain for the 
present value at the epoch © of all the increments of outlay dur- 
ing the time ¢ , the intervals of time being infinitesimal, 


= _ 7, t)-T(t 


(12) ‘ 
cout T(t). at 


° 


In a like manner we may derive an expression for Dit), 
the limit of the sum of the present values at epoch O of all the 
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charges to production during the time ¢ apportioned at some uni- 
form rate U/ to each of the intervals 71. . The charge ap- 
portioned to the interval from t to ¢+Atis L/. At, its present 
value at epoch O is L/ v**?444 The present value of the sum 
of these amounts for all the intervals 21 ¢ between O and ¢ is 


2. U.ytt9-4At yy 


which for infinitesimal values of A¢ has the same limit as 
ZU.v*® At, so that finally 


t 
(13) Dit)= Limit Zuvtat-U/v* ae. 
At-o o 


Let L/() be the value which must be assigned to L/ in 
order to recover 7 (x) , the total outlay for x years through a. 
uniform charge to production, interest considered. Then D(x) 
must equal 7 (x), that is, 


x x 
Ue) [v* df « C+f v* T(t). dt 
oO oO 


from which 


a“ ‘ 
c+fvt T (t).dt 
(14) L(x) = . 


/ v? dt 


oO 





The life-time of the asset is that value of x in (14) which 
will make C/(v) a minimum. The derivative of Cl/(x) with 
respect to x must therefore vanish. Differentiating (14) with 
respect to z and setting the result equal to 0, we find 


w x 
v* Ta) [vt dt- loe/ vir dt) v%20, 


from which 


x 
c+l vtTtt).at 
(15) Fe eee 5 


/ vt dt 


o 
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‘which is the life-equation of the asset, interest considered. 

7. In deriving equation (13) we apportioned the charges 
to production for an interval 217 and found the sum of the pres- 
ent values at epoch O. D/(?) is the limiting value of this sum 
as the intervals 77 are indefinitely diminished. If, as is cus- 
‘tomary, no charge is made to production until the end of the 
year, this single charge will be the aggregate amount of the con- 
stituent portions for the separate intervals Z?¢, accumulated with 
‘interest to the end of the year. The charge for the interval from 
t to t+At is U/-At , its amount at rate é to the end of the 
year is L/. (1+iJ" At , where t is the time to the end of the 
ryear, and the equivalent single charge at the end of the year is 


O =hmit U (+i) At = VU. lint +i) * At 
Atzo At-o 


(16) 


“ 4 
Uf U+i* dt =H 
Qo 


8. As an example let us again take 7(t)=a¢ tht +C, 
then 


T = na je at = v tog ve = (v*1)/109 v, 
[- v’. T(t).at J v*(Lat+rbh)at - 


= [Zart +blv log v-Lalv ’ 1D) eg v)* 
(17) 


T(t) = c+|Zatv*b(v ‘| /10g v-2a(vtl log v)* 


(18) Dit) =U (vEL)/ Jog v, 
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and (15) reduces to 
clog v)* 

(19) v~lagv*=1+ ae 

While the life-equation (19) cannot be solved algebraically, 
it is evident that an approximate solution for could be ob- 
tained from a list of tabulated values of the function v ¥- /ogv~ 
When such a iable is not available, an approximate solution to any 
desired degree of accuracy may be obtained as follows: 


log v where e€ is the base of the 


natural system of logarithms. (19) then takes the form 


We may write for v, e 


(20) eI" vlogv=1+cllogv)*/2a. 


On expanding the first term of this equation into a power series in 
x. and simplifying the result, we have 


x *+ x Vlog V)/ 3 + x Vag J 12+ 2(/og v)7/60 + --: =a, 


whence 
xe C/A : 
~ 142Flog v)/3+2%(log v)2/12 + x "(log v) YOO+ --- 


Now for all ordinary rates of interest log Vv is necessarily very 
small, so that if we denote successive approximations of x by 


X,, %,, Bs, tC, 


x, -(cfa)". «le Tieg WIT | 
C/a U2 
gg Corre regeeenietnn te memneeeeerempnerenorametnamaaiapenat 
Ms | 75, (log v)/f 4 +2zl1og v) Tie | 


fees 
a lina, (10g v)/3 + xZllog v) 712 +25 (log v) Te | 
» etc. 
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Let us take the special case, previously considered in 5., when 
c = 1000, 5-0, a=10, and let the assumed rate of interest be 
6 percent. Then 


log v = -QO.098269, (logv)*= 0003 3W. 
\/log v = -17.16/ 788, I(10g v)* +294. 326 967, 


and we find 
T(t) = /0t7+/000, 
DY) +17 162 Uft-v*), 


and the life-equation is 


v~-/og v*=//09 764 
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FIGURE 3 
The first four successive approximations for z give 
x, =O, %, = 14.14, %,=//07 
44-407 
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The value x =//07 substituted in (14) and (16) give us 
Ula) =22)4F and (x)= 228.03 


This value of C/(z/ substituted for v in the oyprnion for 
DW) gives 


Dt) = 3800.5 (/-v*), 


which represents the present value at epoch C of the aggregate 
momentary charges during a period 7 at a rate such as to recover 
the total outlay 11.07 years, the theoretical life-time of the asset. 
The momentary rate is 221.45 per year, the equivalent single 
charge to production at the end of each year is 228.03. 
(Fig. 3) shows the graphs of the two equations. 


T(t) = /O00- 343 24tv*- 5890 
Di) = 3800.5 (/-v*) 


I‘he abscissa of the common ordinate of the two curves represents 
the life-time of the asset. 

9. It appears from. (Fig. 3) that at the point common to 
the two graphs, the graphs have a common tangent as well as a 
common ordinate. To see whether or not this is a general prop- 
erty let us trace the changes in the total outlay and total charge 
to production functions when interest is taken into account. 

In the first place it is evident that the increments of the 
ordinates of both of the graphs in (Fig. 1) must be replaced by 
their present values at the chosen epoch. If this epoch is the 
effect in question will be to shortefi progressively the ordinates of 
both graphs. The charge to production graph will then be no 
longer a straight line but some convex curve, while the total out- 
lay graph will go over into another graph which is less concave 
than the original graph. But both graphs will continue to rise 
indefinitely as we proceed from left to right because the incre- 
ments of their ordinates, while decreasing indefinitely remain 
positive. 
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FIG. 4 


In (Fig. 4) let 7/¢) and D(?#) represent respectively the 
total outlay graph and the charge to production graph, interest 
disregarded. 7(¢) the total outlay graph, interest considered, 
and D/t) the ‘charge to production graph, interest considered, 
through any point E on 7/¢). The ordinates on D(/¢) repre- 
sent the present values at epoch 7 of the momentary charges to 
production during time # at a rate such as to recover the entire 
outlay during the time corresponding to the abscissa of the point 
£’. This rate is measured by the initial slope of D(t), the 
slope of D(t) when 7=Q. 

Let us follow the changes in this slope for the various posi- 
tions of the point £ as it moves along 7(t) from left to right. 
It is evident that this slope at first decreases, also that it cannot 
keep on decreasing indefinitely, it is therefore plausible that it 
will ultimately increase, reaching a minimum value at the point 
P where the 7(/¢) curve and the D/t) curve have a common 
tangent. The abscissa of the point of contact, /, is then the life- 
time of the asset under discussion. 
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10. The foregoing considerations, however plausible, are 
open to objections, because we have reasoned from graphs result- 
ing from the assumption of a special law governing the repair 
function. Different assumptions might give rise to essentially 
different graphs. We shall, therefore establish the conclusions 
above arrived at, by an analytical proof, which is independent of 
any assumptions regarding the nature of the outlay function. We 
shall prove the 

Theorem: If the rate U/ of a uniform charge to produc- 
tion curve is a minimum, this curve is tangent to the correspond- 
ing total outlay curve, and the abscissa of the point of contact 
represents the life-time of the asset. Conversely, 

If a uniform charge to production curve is tangent to the 
corresponding total outlay curve, U/ is a minimum. 

To prove this theorem, let y = 7(t) be the equation of the 


at t 
total outlay curve, y= D/H) =l/ jf v Zt, the equation of the uni- 
Qo 


form charge to production curve, and x the abscissa of a point 
common to the two curves. 


2 ! za 
Then 7 (2) =D(x) -u/ vat from which 
x 
21 U=TXY/ vat 
i [ 


Since by hypothesis </ is a minimum, its derivative with respect 
to x must vanish, that is 


_ x 
(22) T(x) f vidt-v* T(2)-0 
Qo 
From (22) and (21) follows 
x 
(23) T Ce)av* Fal) | vate v*U2D Cx). 
o 


This shows that at the point common to the two curves their 
slopes are equal, they have therefore a common tangent, and since 
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U/ has a minimum value, 2» must represent the life-time of the 
asset. 

To prove the converse theorem we observe that if the two 
curves have a common tangent at the point ¢=x , 


- x 

(24) T(x) = Die) «uf vtat 
and 

(25) 7 (2) =D (a) = VXU 


Substituting the value of C/ from (24) in (25) we find 


_ _ x 
F(z) =v* T(a)/f vidt 
from which 


x) |v tat v~ T(x)=0. 


But by (22) this is precisely the condition that U is a minimum. 

11. In most cases which arise in practice the analytical 
method of finding the life-equation of an asset fails owing to the 
empirical character of the outlay function. The question suggests 
itself whether a graphic method, similar to that employed in the 
simpler case treated in 7, can be devised, which will yield an ap- 
proximate solution of the problem. The theorems of the pre- 
ceding article offer the key to such a method. 

I.et us suppose that the total outlay graph has been con- 
structed on a convenient scale. the scale depending on the magni- 
tude of the quantities involved. Every point on. this curve 
determines a definite uniform charge to production curve. We 
seek that particular one of these curves which is tangent to the 
total outlay graph. The abecissa of the point of contact would 


| 
+ . 
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give us the life-time of the asset, and the initial slope of the uni- 
form charge to production curve would give us the rate 

Instead of constructing first the total outlay curve, we may 
first prepare a sheet with rulings, as shown in (Fig. 5), each rul- 
ing representing the uniform charge to production curve cor- 
responding to a definite L/ , the successive values U/ being chosen 
at suitable intervals. We may then plot the graph of any given 
outlay function on this same sheet and from among the various 
rulings select that on which comes nearest having contact with the 
outlay graph. This will yield a first approximation of x. A 
closer approximation may then be obtained by the usual processes 
of interpolation. 


Pehent Ceitond, 











THE SIMULTANEOUS DISTRIBUTION OF MEAN 
AND STANDARD DEVIATION IN 
SMALL SAMPLES 


By ALLEN T. CRAIG 








1. Introduction. If samples of 77 items are selected at 
random from a normal universe, it is well known that the arith- 
metic mean X% and standard deviation s computed from samples 
are independent in the probability sense and that the simultane- 
ous frequency distribution is 






_ns*+ 7x” 
Paw+Cae**e@ FF 











If, however, the parent population is other than the normal type. 
there appears to be little known regarding the form of -' (2%, s). 
In the present paper, we propose to determine the simultaneous 
frequency function of the arithmetic mean and standard deviation 
in samples of small numbers of items selected at random from a 
rather arbitrary universe. For convenience, we shall classify 
frequency distributions according as the range of the independent 
variable is (-00, 0) (O a) or(2a)a70Q We shall further 
assume that the total area under the distribution function is unity. 
2. The simultaneous distribution of % and s in samples 
of mek. Let £(%);-00<%x <0 be the frequency “unction of 
the variable « . Let x, and x, be two independentl. erved 
values of x . write 

















X,+X%,= 2X 


af «mS ete*s ta” 
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We seek the function A’ ( Z, s) such that F(X, s)d¥%dSsis, 
to within infinitesimals of higher order, the probability of the 
simultaneous occurrence of X in (%,Z%+dx%)and's in/ss+ds). 
For X% and s assigned, x, may have either value Y-S_ or 
x +S and x, is uniquely determined by x, - £X-x,. 


F(z,s)dzds = f(#-s) f(2x-x,)d%, dxz 


‘Thus 


th(Z+s)f(LX-X, )adx, Wx, . 


Since Ax, Cx,-Ld%ds we have 


(1) F (Zs) = 4f(Z-5) f (X+s). 


If #(2) is defined on the interval (O, co) we note, for X 
assigned, thats < ZX. Thus (1) is valid for this type of frequency 
function but the surface is limited by the x -axis and the line s=X% 

If f(x) is defined on the interval / O a), we note, for 

ZX assigned on (O, 2/2) that s< Z ; and, for % assigned on 
(2/2, a) that s¥ a-X. Accordingly, for this kind of frequency 
function, (1) is valid but the surface is limited by the x -axis and 
the lines s=X,s=@-X. 

As simple illustrations, let us find the correlation surface for 
the mean and standard deviation of samples of two items drawn 
from distributions of various types. 

Example 1. Let 


x= 


Fu)= rae e 2 ~ 00< X <6o 
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Then 
2 _57+x* 
F(X, 5)= saz e =| 
the well known result. 
Example 2. Let 


xz 


Fixjze°, OS xX< 0. 
Then _ 


2X 


F(H,5)=4e 


over the open region of the X¥s-plane bounded by the X -axis and 
the line s=X. 
Example 3. Let 


F(x)=4, Ofx¥4a. 


Then 
sa al 
FCZ,5S) = ae 


over the region of the %S-plane bounded by the isosceles triangle 
with sides SO, S=X% and S-@-X  . Witha uniform dis- 
tribution proportional to 4/a2 over this triangle, it follows in- 
cidentally from very elementary geometry that the marginal totals. 
of the distribution of % are given by the known values 


QR 


S2z 


NTS 


Diy) - Aa zx, 


4 " a 
= 52 (a-z), 24x <a, 
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and that the marginal totals for the distribution of S are given by 


? 


W(s)= & (a-25), o<s<2 


which is the result given by Rider.’ 

3. The simultaneous distribution of % and s_ in samples of 
7=3 . Consider first a frequency function f(x), -a<x<a. 
We have 


Upon eliminating Xz, we have 


a _ = sn 
22, +R, Xy thxZ-OXX, - 6x, -5*+6z*%O, 


rom simple properties of this ellipse, it follows, for assigned X 
and § that x, may be chosen arbitrarily from the interval 


(x-s v2, z+sZ ). With x, assigned, x z must be selected with 
certainty as either 


3Z-x, -|6s* 3 (x, -x)4]? 
ee 


or 
3z-x,+\0s?-3(x,-Z)?| Z 
| ee 


Finally we must have 


Xs =J7x-%,-XqQ. 


'P, R. Rider, On the distribution of ratio of mean to standard deviation 
etc., Biometrika, vol. 21 (1929) pp. 124-141. 
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Thus 


x+SVZ 
F(X, s)dx ds = ef. fd tla tle, dx, dx, dx, 


From 


XxX, =X t 


1? z 
- 32-x,t|65*-3(x,-%)*] s 
> 2 


we obtain 
ds 
ax, aX, ax, = 6 s2-7x,-z)*E dud ds. 
=O9sdx,dzads/R 


where 


R= [6s* F(x, - z)2)# 


(2) 


z+sl2 = y-y- 
F(z,5) = 18s [ i Bt (x,) t( FE") (Dy 


x-S 
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Ii f(x) _ is defined on the interval (O, co), we note, 
for X assigned, that OS s <%2 Thus the surface is limited 
by the % -axis and the line s-x/2. Moreover, since X,,%z, 
%3 are non-negative, x, may be selected from the interval 
(2-sv2, x+s/Z) only as long as s< X% V2/2. If 

% 2/2454 XA. 
then x, may he selected from the intervals 


( 3z-los2- 322] 


Zz 
and 
( 2eeleen 22° e ers) 


\ccordingly, for this type of frequency function, 
Z+5V2 . ji 
= FX ~ 3X -x, - 
F(%,s)= 18s [ £ #(x,) (28 *) A =a Jiahes,, 
Z- si 
X Ve 
Oss¢E —s 


(2.1) 


Z 
3z -l6s% zz" Z+sk 
Jz 16S Jz) 
=i8s / +h caost. pa 
2246 5"- 324) 


O 


1 3x-x,+R FX-x,-F 
= f(x,) t (22 AE ) (22) ax ; 


Xx 


Bo) 


<s<x&. 





k 
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I 


mh 


F (x) is defined on the interval (O a ,), we note: 
* 


for OL XL a/F, OsstXVZ; 
af 
a\2 
for 2a/F< 2 < 2Za/Z, Ozs<s [2z2 20% “¢ | , 


for 2a@/345 Zsa, O£s < (a-z) JZ. 


‘hus in this case, the surface is limited by the x-axis, the lines 
s=x% /Z and s=(a-%)/2 nd the hyperbola 
s2[2z% 2az+20°/F)*. 

(Fig. 1.).. Now «, may be selected from the interval (#-s/Z, 
x+s/Z)as long as ss z/2/2 and s<(a-z)/Z/Z2 . This 
holds for that part of the surface over the region bounded byOF@. 
!‘or that part of the surface over the region bounded by OPL, x, 
may be selected from the intervals 


(o, 3%-[65*.3z*]* ) 


z 
and 
s 2 32]? 
Ss 2Etles3e z= ,2+sf2) 
U V 


8 
We 


aa 
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It is clear that the ranges of arbitrary selection of x, for that part 
of the surface over the region bounded by PW are 


( ;- af 3z-a -[6s*-3/a-x)]* 


Z 

and 
( 3z-a+[6s*3(e-z)*|* a) 
Zz ores 


l‘inally, we find that 2, may be selected from the intervals 


( 3z-a-[6s?- 3(a-z) T’) 


0, Z 
Z Z 
( 2£-a+l6s "3-271 3x-[6 s*-3z%*]* 
a az 


and 
3z+los*3z7]*% 2 
2 ? 
for that part of the surface over the region bounded by PLU/V . 
Ii we adopt the notation 
ZxX- HE) (PF -R 


P=Hx,,%, s)= pe) 3 
we have 


Z+sJ2 
(22) F(¢s)=1Bs / Pdz,, 
Z-s (2 
xz 2_3z z+s/2 
10 / 2#lget-2051 / |e 1 
t 2x46 s*_3 zz 
3x -a-[6s*. x(a-2)*I* 
-104/ - 
X-s/Z 


a 


fecal sah P* 
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4 
3Z-a [6 s#.3@-z)#i* 3X -b6s%.3 z2|z 
2 2 

& 


=18 + 4 
3X-a 16 s2%.3@-z)* |? 
Z 


-/ | Padx,, 


ys 
32z+(6s2% 3xz2]2 
Z 


for the parts of the surface over the regions indicated above. 


In order to illustrate the theory, we shall consider a few ex- 
amples. 


Example 1. Let << 
1 Zo 


Mx) =F Bre » ~ OOK X<o, 
By (2), 35% 3x7 
_.) BB ee 
F(%,3)= SBF se 


Example 2. Let 


f(x)Jze~, Ofx<oo. 


By (2.1), 
F(z,s)-O[3rse OSS ze 
[6s*. 3x2)* 
_3z z-[6s*- 3x x 
= 6/3 se “are sin  s2Zjfo FOIE IM OF 


&. 
neleen Ze) ,2', we 
-arce sin s2fz 212 


Example 3. Let 


f(x)=£, CS zz ia. 


ET 
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By (2.2), 


F(Z, 5)= $278 , over OPa, 


oi lle 


Ss x . x 
lare sin 2 Sze t+ QC =u 





_6Ps 
a? 
-arce SN X+[6s2_ 3248 


ZT 
S22 * Z| over OPU, 


- ois ler sin 2-2 Nos*-Ha-x)*V* 2 ese z)*] 


Z-% 2-a+l6s*.32-2)** 
tle SLITS & - arc sin safe 


+ I, over PvVa, 


_6/3s _ £-a-[6s-3(e-z 212 
= 2B 8| arc S277 8p 


+ arc sin z +$are sin z-[ost 324]? 


‘7 Q-xX 
+ arc Str Sir 
S 


g-a+|pst%3 )2 2 
S22 


> 2 <<21% 
| re ade X+los%3z 


s2/2 ,over PV. 
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I have succeeded in obtaining the marginal totals for s 
from O to al2/A by integrating /(Z,s) with respect to X 
from the boundary (Fig. 1) s-x/2 to s=(a-%)/& and ob- 
tain as a result the parabola which is known! to give the distribu- 
tion of Ss from s=O to s = 2ff6. 

4. The simultaneous distribution of X and Sin samples of 
m= 4 . We shall consider first samples of four items drawn from 


a universe characterized by a law of frequency F(x) -oo« x< 0, 
Then 
xX, +Xo +s E P 
a 2 2 2 a = 
x4 ep eel rey =4s*+ 4x”. 


+X, = 4X, 


The elimination of x, yields 

eo. . - - 
Mp +X, AZ 4U HZ 4H, +¥,X, AEX, -42x,-4ix, -235*+6z77=O 
It follows from the properties of this ellipsoid that 2», may be. 


chosen arbitrarily from the interval (z -s/3 Z+s ba ). For x, 
assigned, the region of arbitrary selection of x, is determined by 
the properties of the ellipse and is 


L £ 
4Axz-x,-h [6s*-2(x,-z)** AZ-x,+Z [o9*-2(x,-2)}* 
3 , 3 

Upon solving for x, in terms of x, and x, we have 


— 4X -x,-x 2 +8 s*-8z7+8zx, +BXX,- 2x,x,-Ix2 3x4]? 


’ Rk 
while x, is uniquely determined by %, =4X-x, -X%,-%,. 
If we write 


L 
T 2(88?-8x?+ 82x, +8%x,-2x,u,-3x,7 -3x2]2 


and 


Z-xX,- yr ¥ -%,-X%y-T 
D2 flu, lu,)t( = Ms Nat «HL i ) 


1H. L. Rietz [Paper to appear presently in Biometrika]. 


es 


TET 


mR CE ee oe een on cn 
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then 


avl~ 


(3) Zes/F . 4¥-% +216s22(x,-z)*] 1 
Fe s)-3es/ 3 ) F 9 dx, dx, 
Z-5/5 /42-z,-2165*2(u,-¥)* |" 

3 ‘ 





The integration can be carried out in an obvious manner when 
f(x) is the normal frequency function. 

In case *(x%) is defined on the interval (QO, a) we note, 
for X assigned, that s< X /3 . Thus the surface is limited by 
the X-axis and the line s = Z/% Moreover, x, may be 
selected from the interval ( z-si3, z+9/F ) with x, chosen 
as above only as long as s< % [3/3 If </9/3<5< X, 
then x, may be chosen from either of the two intervals 


O 42-2[6s4%-2z2]2 
3 
and 
a “ont 
( #228l6s -2 27 55/5 


with x, chosen as above; or x, may be selected from the interval 


&. 
rece tialogneet]”) 
a — ee 


with x, taken from either ‘ 
(2 4%-x,-[8s*-82% 3x,7+8z x]? 
, Rk 


or 4 
(fe-#- $?-62*.3%,2+8%x)* 4%-x,+2[657-2(x, =) 
’ Pt a ey 


when % < s <Z/F_ we may have 


O<x,<2z-[est-227]2 
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4z%+2[6s*- 2x7] Z 


! Zz 


IA 











at 
2z%+(287 -2%7|Z<x 
with either 


£ 
Of x% < 4A Z- x, -[8s*- 8z“- 3x2 +8zx,] « 
ae 






or 


/ 
= ™ - = — 2] 2 
4X-X,H85*-8 27-3 x7 +BX “TD ex AX-x,+2|65-2(x,-%)*) 
x z 


Or we may have 


4 


2 Zz 
4i+2[6 57-22% *J £X,< X+s/3 


3 










with 


aie if. 
4%-x,-2[687-2(x,-%)*\* . .42-%,+2[697-2(x,-%) ™ - 
a ee a 2s 5 






Accordingly, for this kind of frequency function, , 


+53 pAaxX-xX iis 2(x,- zy } 
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140 SIMULTANEOUS DISTRIBUTION OF MEAN 


By similar reasoning, the writer has determined /'/%, s) 


for 7=4 and t(x) defined on the interval (O, a). 


The results, however, are quite lengthy and formal and will not 
he presented here. 


Bataan F Camas 








THE LIMITS OF A MEASURE OF SKEWNESS 


By Haroip Horen.ixe and Lfonarv M. Sotomons, Columbia University 


The measure of skewness 
mean -- median 
standard deviation 

is sometimes recommended because of its simplicity. Obviously 
neither this nor any other statistic can be of much value yntil 
something at least is known of its distribution in samples from 
populations of some plausible form. For populations near the 
normal form the inefficiency of the median as a statistic of location 
suggests that the standard error of § may be considerably greater 
than that of "as. We know of no investigation of the sampling 
distribution of s . .\pparently even the range is unknown. The 
object of the present note is to show that g necessarily lies be- 
tween -1l andl. 

The proof consists of three successive transformations of the 
sample, cach increasing gs . which nevertheless in the end remains 
less than unity. 

1. Without loss of generality let us suppose that the median 
is zero and that the mean % is positive. Taking 
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7 \eing the number of observations, which we suppose odd, we 
have 


s=z/c. 


ii 2 negative observation -@ he replaced hy zero, the mean 
is increased hy a@/77 . In the second of the expressions above for 
or the mean of the squares is diminished by an , while on ac- 
count of the change in the mean. a further subtraction is made. 
Thus o diminishes. Hence g increases if we alter the distribu- 





